Exploring Recovery from Operating System Lockups
نویسندگان
چکیده
Operating system lockup errors can render a computer unusable by preventing the execution other programs. Watchdog timers can be used to recover from a lockup by resetting the processor and rebooting the system when a lockup is detected. This results in a loss of unsaved data in running programs. Based on the observation that volatile memory is not affected when a processor a reset occurs, we present an approach to recover from a watchdog reset with minimal or zero loss of application state. We study the resolution of lockup conditions using thread termination and using exception dispatch. Thread termination can still result in a usable system and is already used as a recovery strategy for other errors in Linux. Using exceptions allows developers to write code to handle a lockup within the erroneous thread and attempt application transparent recovery. Fault injection experiments show that a significant percentage of lockups can be recovered by thread termination. Exception handling further improves the recoverability of the operating system.
منابع مشابه
The role of lockups in takeover contests
This paper examines breakup fees and stock lockups as devices for prospective target firms to encourage bidder participation in takeover contest. We show that, unless bidding costs for the first bidder are too high, breakup fees provide for the socially desirable degree of competition and ensure the efficient allocation of the target to the highest valued buyer in a takeover auction. In contras...
متن کاملبررسی تراکم گاز بیهوشی نایتروس اکساید(O2N) در هوای اتاقهای عمل جراحی و ریکاوری
Chronic exposure to N2O environmental pollution may influence the health of personnel working in operating and recovery rooms. Human studies have indicated that chronic exposure to N2O may decrease mental performance, audiovisual ability, and manual dexterity and may also cause adverse reproductive effects like reduced fertility, spontaneous abortion and neurological, renal, and liv...
متن کاملEnergy Management Simulation in a PEM Fuel Cell System
In this research the simulation of an air independent Proton Exchange Membrane Fuel Cell (PEMFC) propulsion system was taken into consideration. The system consists of several parts including PEM fuel cell stack, metal hydride and liquid oxygen (LOX) tanks, and also pre-heaters of oxygen and hydrogen gases along with other heat exchangers, to ensure proper operation of system. The heat recovery...
متن کاملEnhanced I/O subsystem recovery and availability on the IBM System z9
Although part of the IBM System ze strategy is to improve design and development processes to prevent errors from escaping to the field, improving recovery is another element in the strategy to keep a machine up and running should an error occur. The z9e continues on an evolutionary path of enhancing I/O subsystem (IOSS) recovery to further advance the reliability, availability, and serviceabil...
متن کاملNew Markov Chain Models to Estimate the Premium for Extended Hedge Fund Lockups
To estimate the premium an investor should expect from extended hedge fund lockups, Derman et al. (2009) proposed a three-state discrete-time Markov Chain to model the state of a hedge fund, allowing the state to change randomly among the states “good,” “sick” and “dead” every year. The lockup premium measures the consequence of being stuck with a sick fund. To be more realistic, we propose an ...
متن کامل